Skip to main content

SHOW DETAILS
up-solid down-solid
eye
Title
Date Published
Review
The Dataset Collection
favoritefavoritefavoritefavoritefavorite Jul 1, 2015
data
eye 50,469
favorite 8
comment 3
favoritefavoritefavoritefavoritefavorite

Find the dataset available for instant analysis in BigQuery and queries on this reddit...

(Here is the original Reddit comment announcing this collection of data and what the processes were.) This is an archive of Reddit comments from October of 2007 until May of 2015 (complete month). This reflects 14 months of work and a lot of API calls. This dataset includes nearly every publicly available Reddit comment. Approximately 350,000 comments out of ~1.65 billion were unavailable due to Reddit API issues. Q: How are the files structured? Each file is compressed with bzip2 compression....

Find the dataset available for instant analysis in BigQuery and queries on this reddit...